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ABSTRACT 

The effects of sex, verbal ability, test anxiety, 
ideational fluency and training procedures on Formulating Hypotheses 
test performance were studied.. Training consisted of presentation of 
models of "acceptable" responses that stressed either quantity or 
quality performance. Both the quantity and quality models were 
effective in modifying behavior in the expected direction. Ideational 
fluency was related to number of responses, and verbal ability was 
related to scores reflecting quality. Females were in general 
superior to males with respect to scores reflecting quantity of 
responses. Test anxiety was not significantly associated with 
performance. Weak evidence of treatment-anxiety and sex-vocabulary 
interactions was found. (Author) 
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EFFECTS OF MODELS OF CREATIVE PERFOR^IANCL 
ON ABILITY TO FORMULATE HYPOTHESES 

Abstract 

The effects of sex, verbal ability, test anxiety, ideational fluency 
and training procedures on Formulating Hypotheses test performance were 
studied. Training consisted of presentation of models of "acceptable" 
responses that stressed either quantity or quality of performance. Both 
the quantity and quality models were effective in modifying behavior in 
the expected direction. Ideational fluency was related to number of 
responses, and verbal ability was related to scores reflecting quality. 
Females v;ere in general superior to males with respect to scores re- 
flecting quantity of responses. Test anxiety was not significantly 
associated with performance. Weak evidence of treatment-anxiety and 
sex-vocebulary interactions was found. 



EFFECTS OF MODELS OF CREATIVE PERFOR>IANCE 
ON ABILITY :0 FORMULATE HYPOTHESES"^ 

Norman Frederiksen and Franklin R. Evars 
Educational Testing Service 

In a previous study (Klein, Frederiksen, & Evans, 1969), the role 
of anxrety in learning a task requiring "creative^* responses was investi- 
gated. The task involved the ability to think of hypotheses to account 
for research findings, using a test calle ' Formulating Hypotheses (FH) 
(Frederiksen, 1959). The training consisted of presenting a model con- 
sisting of ci list of "acceptable" responses after the completion of each 
item. The list coi tained 20 or more hypotheses, not all of which were 
necessarily of high quality. Students typically wrote only five or six 
responses; therefore the models v?re presumably perceived by S^s as 
emphasizing quantity more than quality of performance. 

The specific hypothesis investigated was that the performance of 
anxious students would improve more than that of less -anxious students 
as a result of the training. The idea was that performance on such a 
tc sk involves some self-censorship of ideas, especially by anxious people 
in a situation calling for creative responses, people are likely to have 
more ideas than they report because they censor ideas that they think 
aren^t "good enough" and therefore might lead to embarrassment. The 
feedback of models of "acceptable" responses, it was thought, would 
decrease the amount of censorship, thereby increasing the number of 
responses, and this effect would be greater for anxious than for less- 
anxious individuals. 



It was found that the presentation of lists of model responses did 
produce a significant increase in the number of hypotheses written. It 
did not, however, produce an increase in the number of acceptable hypotheses 
written, and there was no evidence of transfer to another task requiring 
divergent production of semantic material (Guilford's Consequences test). 
Since the interaction of treatment and anxiety was not significant, the 
hypothesis that treatment would be more effective for anxious ^s was not 
confirmed. The overall results were tentatively interpreted as showing 
that the models comprising the experimental treatment were effective in 
changing the S^'s standards as to what constitutes satisfactory perform- 
ance rather than his ability to deal with the problems. 

An unexpected finding was that the relationship of anxiety to the 
number of hypotheses produced was curvilinear; poorest performance was 
associated V7ith a middle level of anxiety and best performance with a 
low level of anxiety. This result is. not predicted by drive theory and 
is contrary to Stennett's (1957) suggestion that the relationship between 
performance and drive level is nonmonotonic with the shape of an inverted U. 

The purposes of the precent investigation were (1) to attempt to repli- 
cate the results of the earlier study, particularly the effects of quantity 
models on number of responses and the curvilinear relationship of anxiety 
to performance; (2) to see if ''quality*' models of performance that were 
intended to improve quality of performance would be successful; (3) to 
investigate sex differences in creative performance; (4) to investigate 
the interactions of treatments, sex, ability, and anxiety in relation 
to quantity and quality of creative responses; and (5) to attempt to 
learn something about the processes involved in the improvement of per- 
formance. 
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Me thod 

Subjects 

The Ss in this study were 395 paid volunteers, mostly freshmen, at 
two state colleges in Pennsylvania. Approximately half were males and 
half females. 

Measures 

Formulating hypotheses. Five of the dependent Treasures were pro- 
vided by a test called Formulating Hypotheses (FH), which is described 
more completely elsewhere (Frederiksen , 1959; Klein et al., 1969). Each 
item of the test consists of a graph or table showirg findings from an 
actual research investigation in an area of social science. One item, 
for example, presents a graph showing that- during a period following 
World War II employment decreased in mining and increased in all the 
other industries included in the survey; tihis finding is stated in words 
at the bottom of the graph. The S_'s task is to write hypotheses 
(possible explanations) that might account for, or help to account for, 
the finding. A sample item was presented showing examples of responses. 
Seven FH items were used instead of the eight in the earlier study. 

In the previous study, two scores based on FH were used as dependent 
variables; Number of Hypotheses and Number of Acceptable Hypotheses, and 
they were also used in the present investigation. Number of Responses 
(FH Score 1) is the total number of nonduplicate hypotheses written by an 
S, and Number of Acceptablfi Responses (FH Score 2) is a subset of FH Score 
1 responses that includes only hypotheses previously judged by a panel to 
be "acceptable. This score was used, in spite of its experimental depend- 
ence on FH Score 1, in order to make possible a more exact replication of 
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certain aspects of the previous study, and also because it reflects 
quality as well as quantity of ideas produced. 

FH Score 3 is a new score called Average Judged Quality of the re- 
sponses. It is the average of ratings made by two scorers (using a nine- 



FH Score 4 is called Average Scale Value. It is another quality score, 
obtained by a method that makes it relatively independent of such qualities 
as length, handwriting, or grairanatical correctness. The method made use of 
a master list of the hypotheses written by students, as derived from a con- 
tent analysis. A panel of judges made evaluations of the hypotheses on 
this list, and a scale value was assigned to each response on the basis 
of these ratings. The scorer's task was merely to decide which listed 
hypothesis, if any, was similar to the one being scored, and to record 
the number of that listed hypothesis. A computer later assigned the 
corresponding scale value to the response e 

FH Score 5 is Average Number of Words per response. It was included 
to provide some insight into the processes involved in modifying behavior 
by presenting models, and the cues used by scorers in evaluating responses. 

An attempt was made to develop another score that would represent the 
rarity, unusualness, or originality of responses. The method involved 
getting a weighted score based on the number of ideas that occurred at fre- 
quencies below three specified levels. It was found that such scores were 
unreliable, even though frequencies as high as 40% were used as the basis 
for keying. Therefore no originality score was inducted in the study. 



point s^'ale) of the quality of the responses, qualitv being defined in a 

/ 

sens^e that was consistent with the instructions to the examinees who took 





the FH test. 



Each of the five FH scores was obtained separately for each of ^.he seven 
FH items; this made it possible to obtain scores for subsets of it.ems. Two 
subsets were used: (1) The first two items (which were completed before 
any feedback materials were presented), and (2) the last five items (which 
were all susceptible to influences of the experimental treatments). The 
five FH scores based on Items 1 and 2 are called FH Pretest Scores and 
were used as covariates in the statistical analysis. The five FH scores 
based on Items 3 to 7 were used as dependent variables. Unit weights were 
used in obtaining the pretest scores. The weights used in obtaining the 
five-item composite scores were their loadings on the first principal com- 
ponent resulting from a principa] axes factor analysis of the intercorre- 
lations of the five items. Five such factor analyses were done, one for 
each of the five FH scores. This method yielded a score representing the 
common variance in each of the onci-f actor systems. (However, since the 
loadings were about equal, unit weighting would have served about as well 
in this instance.) 

Consequences test . ■ Consequences is one of the tests used by Guilford 
(1967) to measure divergent production. Each item presents a hypothetical 
situation (e.g., "What would be the results if people no longer needed or 
wanted sleep?"), and the task is to list possible consequences of that 
situation. Two scores were obtained, using Guilford's scoring methou: 
Consequences-Obvious and Consequences-Remote, representing Divergent Pro- 
duction of Semantic Units and Divergent Production of Semantic Transfor- 
mations, r^^spectively, in the structure-of-intellect model. These scores 
were used to measure transfer of experimental effects to another task in- 
volving divergent pr' luction and were treated as two more dependent vari- 
ables* 
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Cognitive tests . Two tests from the Kit of Reference Tests for Cogni- 
tive Factors (French, Ekstrom, & Price, 1963) were also used: Advanced 
'Vocabulary, a 36-itein multiple-choice synonyms test; and the Theme Test, 
which requires S to write two themes, the score being merely the number 
of words written. These two tests measure the factors of verbal, compre- 
hension and ideational fluency, respectively, according to the French, 
Ekstrom, and Price manual. In the structure-of-intellect model, the 
ITieme test represents Divergent Production of Semantic Units. 

Test anxiety . The same inventory used in the previous study pro- 
vided the Test Anxiety score; this scale contains items from Harleston's 
(1962) measure of test anxiety and the items from the Alpert-Haber (1960) 
debilitating anxiety scale. Since the Def ensiveness scale showed little 
relationship to performance in the previous study, it was not included 
in this investigation. 

Procedure 

The procedure used was very similar to that employed in the previous 
experiment, except that there were two experimental treatments instead of 

one, and the data were obtained in one long evening session rather than 

in three separate sessions. 

The experimental treatments consisted in providing models of accept- 
able performance, at the completion of each FH item (except the first), 
in the form of a list of hypotheses pertaining to that item. One treat- 
ment (the quantity treatment) was essentially the same as that used in 
the first study; it consisted of providing a fairly long list (18 to 26) 
of ^^acceptable hypotheses" illustrating ideas that S_ might have written 
in response to the preceding item. Tlie other treatment (quality) was 

ERiC 
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similar except that each list of acceptable hypotheses included only the 
best ideas, carefully worded. The quality list typically included only 
six or seven hypotheses, and they were somewhat lonj^er than those used 
foi the quantity models. 

Members of the control group received no models'; instead, they were 
givf n various questionnaires to occupy their time in what appeared to be 
a relevant way. 

The lists were intended to provide models of perfo^^mance that S^s 
would try to emulate in one way or another. Both the quality and quantity 
materials are believed to show a rather striking contrast to the work of 
most students: The quantity lists contained many more responses than the 
average S. wrote, whj le the quality lists were noticeably superior in quality, 
both with respect to ideas and wording. The materials (both quantity and 
quality) were represented to the S^s as responses written by college stu- 
dents that "were judged acceptable in that they give plausible explanation 

of the finding, although some... are no doubt better than others Tliese 

hypotheses are presented merely as examples of good responses by other 
students, in order to stimulate your thinking,** In order to insure that 
the lists were read, was instructed to study the list carefully, then 
to go back to his own list to make any revisions or additions he felt 
would improve the list. (The FH answer sheet produced a copy of S^'s 
responses. The original was removed before the feedback materials were 
presented, and the revisions were made on the copy. Only thR original 
was used in scoring,) 

All Ss from a given college were seated together m a large room, and 
all three treatment groups were handled simultaneously. Assignment of 
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treatments to Ss was accomplished by handing to every third S. as he entered 
the room an envelope containing materials for one particular treatment. All 
documents were numbered as shown in Table 1, and instructions were given by 
referring to document numbers. The document in use at a particular period 
was taken by S from the top of the pile in his envelope and was placed at 
the bottom of the pile in the envelope when completed. S^s knew that the 
materials were not identical for all S s , but they did not know the nature 
or purpose of the different treatments. 



Insert Table 1 about here 



Statistical Analysis 

Meana, standard deviations, and intercorrelations of all variables 
were computed for all Ss combined and separately for the three experimental 
groups. Similarly, means, standard deviations, and intercorrelations were 
computed for the seven FH items, once for each of the five FH scores. Re- 
liabilities were computed where possible. 

Because of the relatively large number of dependent measures, the 
method of analysis chosen was multivariate analysis of covariance (MANCOVA) 
(Clyde, Cramer, & Sherrin, 1966). There were seven dapendent measures: the 
five FH scores based on Items 3 to 7, Consequences-Obvious, and Consequences- 
Remote. Since there were small differences between tha two colleges in stu- 
dent performance, one covariate V7as the dichotomy college attended. Other 
covariates were the five FH Pretest Scores. These five scores are appro- 
priately used as control variables whenever we are interested in change 
in performance on FH (i.e., when investigating effects of treatments). 



T 
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The design factors were treatment, sex, vocabulary, ideational fluency, 
and rest anxiety. However, it was not possible to use all five design factors 
in one analysis because some cell frequencies became too small. Instead, 
three separate MANCOVA's were done with overlapping factors. The three 
analyses employed the following design ji'actors, with number of levels 
within each factor shown in parentheses: 

1. Treatments (3), sex (2), ideational fluency (2), test anxiety (3) 

2. Treatments (3), sex (2), ideational fluency (3), vocabulary (2) 

3. Treatments (3), sex (2), vocabulary (2), test anxiety (3) 
Three jevels of test anxiety and of ideational fluency were used in order 
to make possible the detection of nonlinear relationships. Ea^a of the 
three designs was used once with collefie attended as the only covariate, 
and once with the five FH pretest scores used as covariates in addition to 
college attended . 

The MANCOVA model employed in the analysis first renoves variance 

attributable to the four-way interaction, then lower-order interactions, 

and finally it deals with main effects. Each main effect was computed so 

that it was orthogonal to all other main effects and all interactions. Thus 
2 

the R may be interpreted as the percentage of variance uniquely attributable 
to the factor under consideration. 

Results 

Intercorrelations and Reliabilities 

Table 2 presents the means, standard deviations, intercorrelations, 
and reliabilities of the variables used as covariates and as design vari- 
ables, using data for all subjeccs combined. Since the treatments may 
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af feet intercorrelations involving dependent variables, the intercorre- 
lations of dependent variables are shown separately in Table 3 for the 
three treatment groups as well as for the total group; and in Table 4 
are shown the correlations of th -n ..^ variables with the other 

variables for the three treatment groups and for the total group. 

Insert Tables 2, 3, and 4 about here 

Reliabilities of the various measures are sho\m in the main diagonals 
of Tables 2 and 3. (Reliabilities are based on the total group.) The last 
five diagonal entries in Table 2 arc the correlations between the two 
items making up the FH Pretests, corrected for double length. The first 
five diagonal entries in Table 3 are the reliabilities for the five-item 
FH test, computed by obtaining the average of the item intercort 3lations 
and correcting for length by the Spearman-Brown formula. The correlations 
between the two-item and the five-item tests are shown in Table 4; these 
may be thought of as alternate form reliabiJ ities. These correlations are, 
of course, attenuated by the fact that one of the two tests contains only 
two items. FH Score 5 (Average Number of Words) is the most reliable, 
and FH Score 4 (Average Scaled Value) is the least reliable of the scores. 
While the two-item pretest is adequate for use as a control variable, a 
longer test would obviously have been better. The data indicate that 
highly reliable measures of FH performance can be built by using a 
sufficient number of items. 

Intercorrelations of FH test scores show that the FH Score 1 and 
FH Score 2 are highly correlated, as is to be expected since FH Score 2 
is based on a subset of the responses contributing to FH Score 1. The 
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two scores designed to measure quality~FH Scores 3 and 4— are also highly 
correlated, in comparison with their reliabilities, and Score 3 has a sub- 
stantial correlation with FH Score 2 (Number of Acceptable Hypotheses), 
which reflects quality as well as number of responses. The correlation 
between FH Score 3 (Average Rated Quality) and FH Score 5 (Average Number 
of Words) suggests either that raters tend to be impressed by long responses 
or that a more lengthy response is necessary for higher quality. 

Since scores on the five-item FH scores are influenced by the experi- 
mental treatments, it is important to look at their intercorrelations and 
correlations with other variables separately for the three treatment 
groups. Treatments might influence the correlations as well as the means; 
if the relationships were altered appreciably, the meaning of a score, in 
terms of its factorial composition, could be changed, which might make 
interpretation of the MANCOVA results misleading. 

Table 3 includes the intercorrelations ot the dependent variables for 
the three treatment groups. There were some differences in correlations 
that might be attributable to treatments. For example, correlations involv- 
ing FH Score 5 (Number of Words) were apparently reduced or made negative by 
the Quantity feedback. However, the differences are not great, and the 
pattern is generally the same for all three groups. The correlations with 
independent variables (Table 4) do not differ greatly for the three groups. 
It is therefore concluded that factor structure was not substantially 
altered by the treatments and MANCOVA results would, from this point of 
view, be interpretable. 

Canonical Vari ates 

Table 5 showc the correlations of the dependent variables with the 
four significant canonical variates (out of six computed) that were obtained 
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by using college attended and the five FH Pretest Scores as covariates, and 
without including any of the design factors. The canonical correlations 
(shown in the row labelled R) are correlations between the best-weighted 
combination of dependent variables and the best-weighted combination of 
covariates. These canoi-ical variates are orthogonal, and they result fron 
a step-down model (variance attributable to the first canonical variate is 
removed before computing the second, etc.). The correlations of dependent 
variables with the canonical variate may be used like factor loadings to 
interpret the canonical variater^. (There arc three sets of zero-order 
correlations and three R's for each canonical variate, which result from 
the analyses of the three different combinations of design factors described 
earlier. The reason for the slight differences among sets is that the pooled 
within-cell sums of squares differ slightly from one combination of design 
factors to another.) 

Insert Table 5 about here 

The first canonical variate was obviously defined by FH Score 5 (Average 
Number of Words). The canonical correlation between covariates and the de- 
pendent variables was high, about .75, no doubt in part because of the higher 
reliability of FH Score 5, which correlates about .98 with the canonical 
variate. The only other dependent variable with appreciable correlations 
was FH Score 3, the Average Quality Rating. These correlations suggest 
that there may be a tendency for raters to give higher ratings to the longer 
responses; or, possibly, longer responses are necessary for statements of 
high quality. 

Canonical Variate II was defined mainly by FH Scores 1 and 2 (Number 
of Hypotheses and Number of Acceptable Hypotheses). Sizable correlations 
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also occurred for Consequences-Remote , but not for Consequences-Obvious, 
which suggests that the quantity of production on the FH test involves 
divergent production of semantic transformations rather than semantic units, 

Canonical Variate III was mainly correlated with FH Score 3, the Aver- 
age Quality Rating. Other substantial positive correlations were found 
for FH Score 2 (Number of Acceptable Responses) and Hi Score 4 (Average 
Scale Value) reflecting the quality component in both these scores. In 
contrast to Canonical Variate II, the correlations with Consequences-Remote 
were negative, suggesting that the FH quality scores were quite different 
from the quantity scores with respect to the influence of fluency. 

The last canonical variate, which was barely significant, was corre- 
lated mainly with FH Score 4, the Average Scale Value. IH Score 4 is the 
least reliable FH score, and a good deal of the variance attributable to 
it had already been allocated to Canonical Variate III. Thus there appear 
to be three major components in the domain of the dependent variables: 
length of responses, number of responses, and quality of responses. 

When college attended was used as the only covariate, the canonical 
variate was significant (£ < .01), showing that college attended was sig- 
nificantly related to the dependent measures; but the canonical correlation 
was only about .25. The correlations with the canonical variate may be 
interpreted as showing that one college was superior with regard to the 
quality score (FH Score 3), and the other was superior on Number of 
Hypotheses (FH Score 1) and on Consequences-Remote. No hypothesis Cc.n 
be offered to account for these differences other than the possibility 
of some unintended difference in the conditions under which the tests were 
administered. The results do justify our use of college attended as a 
covariate. 
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Multivariate Analysis 

Tables 6, 7, and 8 present the salient findings of six MANCOVAs. 
The analyses differed with respect to tbe combination of design factors 
employed, as was described previously, ai^d with respect to the covariates 
used ( college attended onlv or college attended and the five FH Pretest 
scores). Table 6 shows the results for the design factors treatment, sex, 
ideational fluency, and test anxiety; Table 7 the results for treatment, 
sex, ideational fluency, snd vocabulary; and Table 8 the results for treat- 
ment, sex, vocabulary, and test anxiety. For treatment and for interactions 
involving treatment, the results are reported only for the analyses where 
FH Pretest scores are used as covariates. For the remaining design factors 
and interactions, results are reported for analyt where college attended 
is the only covariate. The FH Pretest scores are used as covariates for 
treatment effects becaufse we wish to use a measure of change in evaluating 
the effects of the quality and quantity models. Results for all the main 
effects are reported, but results for interactions are reported only if the 
significance level reaches the 5% level for either multivariate or univariate 
tests. The results shown for the main effects were computed in such a way 
that can be interpreted as the percentage of variance uniquely attributable 

Insert Table 6 about here 

to a particular factor. 

A word about the contents of the three tables: The first column indi- 
cates what main effect (or interaction) is described in the corresponding 
row of the table. The second column shows the overall multivariate F-ratio 
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and its related ^ and R values. (The second canonical variate was not 
significant in any instance.) Except in the case of an interaction, mean 
scores on the first canonical variate for the appropriate subgroups are 
shown in the next column. (The grand mean is set at zero. ) In the next 
column are shown the salient correlations of dependent variables with the 
first canonical variate; this information makes clear what constitutes 
each canonical variate and thus which dependent variables contribute most 
to the means shown in the preceding column. In the last column are shown 
the £ -values for univariate tests for the dependent variables shown in 
the preceding column. An entry was made in the last column if (1) the R 
with the canonical variate was > .30; or (2) the univariate significance 
level was _< .05. 

Results for treatment, sex, ideational fluency, and test anxietv . The 
largest R in Table 6 is .46, for ideational fluency, whose relationship to 
the first canonical variate was highly significant (^ < .001). The correla- 
tions of dependent variables with the canonical variate show that ideational 
fluency was related mainly to Consequences-Obvious, but also to Consequences- 
Remote and FH Scores 1 and (to some extent) 2. Univariate tests were signifi- 
cant for all these variables. The results appear to support Guilford's 
placement of both the Theme test and Consequences-Obvious in the same cell 
of the structure-of-intellect model, and they also show that the Number of 
Hypotheses score has a large component of divergent production of semantic 



units . 



The second-largest R is .37, for treatments. The five FH pretest scores 
11 as college attended were used as covariaLes for this design factor. 
(The effects of treatments were greater— R = .37 as compared with .31- -and 



as we 
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more clearly focused when these six covariates were used than when only 
college attended was used as a covariate, while the pattern of performance 
remained the same.) Quantity treatment improved performance as measured 
by FH Score 1 (Number of Hypotheses) and FH Score 2 (Number of Acceptable 
Hypotheses), while quality treatment tended to produce fewer, longer, and 
somewhat better responses. The univariate tests for the four FH scores 
were all significant, including those for FH Scores 3 and 5. A one-way 
analysis of variance, with FH Score 3 as the dependent variable and with 
college attended and FH Pretest Score 3 as covariates, showed that mean 
performance of the quality treatment group on FH Score 3 was signifi- 
cantly higher than fhat of the control group (F = 7.34; p < .007). Tnus 
both quality and quantity treatments produced the expected changes in 
performance. 

The R for sex is .26. Females were superior to males on a canonical 
variate that is positively correlated with Consequences-Obvious and FH Scores 
2 and 5, and negatively correlated with Consequences-Remote. Thus females 
were found to be superior vith regard to performance on tests that reflect 
number of hypotheses, number of words, and number of obvious consequences; 
but they were poorer on remote consequences. 

The multivariate £ for test anxiety was not significant (£ < .12). The 
means show that low anxiety Ss tended to be superior on a canonical variate 
that correlates most highly with Consequences-Remote and FH Score 2 (Number of 
Acceptable Hypotheses). The univariate test for Consequences-Remote was sig- 
nificant (£ .033). Thus high anxiety appears to have suppressed performance 
on an ability similar to divergent production of semantic transformations. 
The means on the canonical variate for low, middle, and high anxiety groups 
showed no evidence of a nonlinear relationship such as was found previously. 
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The MANOVA test for a treatment-anxiety interaction yielded a nonsig- 
nificant F . However, the univariate test for FH Score 4 was significant 
(£ < .014). Examination of the nine cell means for FH Score 4 shows a 
tendency for low- anxious S^s in the treatment groups to out-perform low- 
anxious Ss in the control group, while the opposite was true for high- and 
middle-anxious Ss. Thus either quantity or quality treatment tends to 
benefit low-anxious Ss more than middle- or high-anxious Ss. 

Results for treatment, se x, ideational fluency, and -ocabular y. 
Table 7 reports the MANCOVA results for four design variables that jnclude 
vocabulary. Verbal ability, as measured by the Vocabulary test score, was 
significantly related (R = .30, £ < .001) to a canonical variate that clearf.y 
reflects quality of performance on Fh and a tendency to write long responses. 
The univariate tests were all significant, and uhe relationship to verbal 
ability was positive, as is shown by the means. 



Insert Table 7 about here 



The results for treatments, sex, and ideational fluency were all quite 
similar to those shoim in Table 6, except that in the case of sex the uni- 
variate tests were not significant for FH Scores 2 and 5. Inclusion of 
vocabulary as a design variable has apparently removed some of the variance 
that was attributed to sex in the analyses reported In Table 6. 

Univariate tests show some evidence of an interaction of sex and vocab- 
ulary involving FH Scores 2, 3, and 4, the measures reflecting quality of 
performance on FH. The means on FH Score 2 in the 2 x 2 interaction table 
show that females generally earned higher scores, but the difference between 
males of high and low verbal ability was much greater than that between 
females of high and low ability. 
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Refiults for .reatment, sex, vocabulary, and test anxiety . Table 8 
presents results for the third combination of design variables, which pro- 
vides an opportunity to see if there is a vocabulary-anxiety interaction. 
None was found. The sex-vocabulary interaction did appear again, and in 
this analysis the multivariate test was significant (£ < .016) as well as 
the univariate tests. The univaric^te test for the treatment-anxiety inter- 
action was again significant for FH Score 4. Generally speaking, results 
for main effects were very similar to those found in the o^rher two analyses 
except for some differences in details of the canonical variate for sex that 
are attributable to the variations in design factors. 

Insert Table 8 about here 



Means and Intercorrelations of HI Items 

The rate of change in performance on the seven FH items is of interest 
because it might provide a basis for inferences about the processes involved 
in learning to formulate hypotheses under the experimental conditions. Two 
competing hypotheses are that (1) improvement reflects a change in ability , 
and (2) improvement reflects a change in standards as to what constitutes 
satisfactory performance. The first hypothesis implies a gradual process 
of learning, which would be reflected in a gradually rising curve; and the 
second would be more consistent with a sudden increase in performance 
following the first experience with one of the models. Although the shape 
of the curve could not rigorously demonstrate either process, evidence of 
either gradual or sudden improvement would make the corresponding hypothesis 
a bit more attractive. 

Another possible approach, one involving individual differences, is to 
examine the intercorrelations of the item scores to see if treatment groups 
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differ from the control group with respect to the pattern of intercorre- 
lations. Learning data are usually characterized by a simplex pattern (high 
correlations for adjacent trials and gradual reduction in correlation between 
trials as they become more widely spearated in time). If, on the other hand, 
the change in performance is sudden rather than gradual, one might expect 
low correlations of Items 1 and 2 with the remaining items, and high corre- 
lations between Items 1 and 2 and among Items 3-7, 

Both of these approaches were tried; it was concluded that the data 
are too unreliable at the item level to yield interpretable findings. 

Discussion 

The effects of treatments on change in perfomance were shown to be 
highly significant; the proportion of variance in the canonical variate 
accounted for by treatments is about .13, when pretest scores are used as 
covariates. The effect of quantity treatment was basically to increase the 
number of FH responses and decrease the average number of words per response, 
while the effect of the quality treatment was to increase the average number 
of woids per response, increase the quality of responses, and decrease the 
number of responses. The result of the earlier study is thus confirmed in 
that quantity models were found to increase quantity of performance. The 
effect of the quality treatment is smaller as judged by correlations of FH 
scores wit*h the canonical variate, but a separate one-way analysis of vari- 
ance confirms the finding that the quality models result in responses of 
higher quality. The change in performance tends toward a literal copy of 
the models in terms of number, length, and quality of responses. No evi- 
dence of transfer of training to the Consequences test was found. 
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Ideational fluency was found to account for a relatively large pro- 

2 

portion of the variance in the domain of dependent variables (R = .23) ; 

and the other ability measure employed, the Vocabulary test, was also sig- 

2 

nificantly related to performance (R = .09). These two ability measures 
predict quite different aspects of performance: ideational fluency is 
related to quantity of production (especially the Consequences test scores 
and FH Score 1), while vocabulary is related to quality of performance. A 
definition of creativity in terms of fluency would appear to be correct only 
if the quality of creative performance were ignored. 

2 

Sex was also found to be a significant factor (R = .07), although the 
proportion of variance contributed depends somewhat on what other design 
factors are included in the analysis (since sex is significantly correlated v/ith 
anxiety, vocabulary, and, especially, ideational fluency). Females were 
generally superior on a canonical variate that correlates positively with 
Consequences-Obvious and FH Scores 2 and 5. 

2 

Test anxiety accounted for only a small amount of variance (R = .04) ; 
the canonical variate primarily reflected the Consequences-Remote score. 
High anxiety was associated with poorer performance, and there was no evidence 
of a nonlinear relationship. 

A salient finding of the earlier study was a U-shaped relationship 
between test anxiety and Number of Hypotheses. A possible reason for the 
failure to replicate the curvilinear relationship is that the relationship 
of performance to anxiety is different for males and females (the previous 
study used only male subjects). Although a significant anxiety-sex inter- 
action was not found, a plot l,not shown) of FH Score 1 (the variable involved 
in the earlier study) against the three levels of test anxiety showed that 

ERIC 
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the same U-shaped curve existed for taale Ss, while for females the curve 
was more or less linear and descending (high anxiety associated with fewer 
hypotheses) . Thus the data at least suggest that the relationships for 
males are basically similar to those found previously and that sex differ- 
ences exist. 

The anxiety-verbal ability interaction found in the other study was 
also not replicated. 

Weak evidence of a treatment-anxiety interaction was found involving 
FH Score 4. Since the hypothesis that motivated the original study was that 
anxious individuals would profit more from the treatments than less-anxious 
people, and that hypothesis was not then confirmed, the finding of even a 
weak treatment-anxiety interaction is of interest— even though FH Score 4 
was involved rather than FH Score 1. A plot (not shown) of treatments against 
FH Score A means for the three levels of test anxiety showed relationships 
completely unlike those predicted for FH Score 1. The plot shows that for 
the control group, quality of performance was poorest for low-anxiety indi- 
viduals, while for both treatment groups performance, was higher for low- 
anxiety S^s. At higher levels of anxi3ty, the performance of control group 
members was superior to the treatment groups. Thus the relationship was 
the opposite of what had been predicted for FH Score 1. However, since 
FH Score 1 is a quantity score and FH Score A a quality score, the re- 
sults may not be inconsistent with the original hypothesis. 

A weak sex-vocabulary interaction was also found, the correlates of 
the canonical variate including FH Scores 2, 3, and A, and Consequences- 
Remote. The first three of these variables emphasize quality of performance 
on FH rather than pure fluency. Examination of appropriate plots showed that 



performance of females v/as generally superior, but there was relatively 
little difference between males and females of high ability while low- 
ability females were much superior to low-ability males. 

Formulating Hypotheses appears to possess appropriate psychometric 
properties for further explorations in the realm of creative performance. 
It possesses a certain amount of face validity, the iL3ms being concerned 
with interpretation of real data obtained in various kinds of scientific 
undertakings, and therefore may possesb certain advantages over such tests 
as "brick uses" and "consequences." The scores so far developed are reason- 
ably adequate from the standpoint of reliability, and the interitem corre- 
lations tend to be sufficiently high that one could build a test of almost 
any reliability he desires by increasing the number of items. The span of 
abilities covered by the present five scores appears to include quantity 
of performance, quality of performance, and length of responses. It would 
be desirable to add scores measuring rarity or originality of responses. 
The study provides some evidence of the construct validity of the test, 
since the scores generally relate to other measures and to treatments in 
ways that are logical or in accordance with theoretical expectations. 
The use of tests like FH may be useful as provisional criterion measures 
in investigations of scientific creativity — i/fes trainability , the influences 
of situational factors, and the cognitive, attitudinal, and temperamental 
characteristics associated with it. 
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Table 1 
Sequence of Presentations 



Document 
Number 



Control 



Group 
Quality 



Quantity 



Time (in 
minutes) 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 



Questionnaire 
Questionnaire 
Questionnaire 
Questionnaire 
Questionnaire 



Personality Inventory 
Advanced Vocabulary Test 
Theme Test 
FH Practice Item 
FH Item 1 
FH Item 2 
Quality Models 

FH Item 3 
Quality Models 

FH Item 4 
Quality Models 

FH Item 5 
Quality Models 

FH Item 6 
Quality Models 
FH Item 7 
Consequences Test 



Quantity Models 
Quantity Models 
Quantity Models 
Quantity Models 
Quantity Models 



un timed 
8 
8 

untimed 
10 
10 

7 
10 

7 
10 

7 
10 

7 
10 

7 
10 
20 
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Table 3 



Intercorrelations^ of 


Dependent 


Variables 


for Total Group and 






for the 


Three Treatment 


Groups 








FH 


FH 




FH 


FH 




Fli 


Cons . 


Conq . 




Score 1 


Score 


2 


Score 3 


Score 


4 


Score 


5 Obvious 


Remote 




.80^ 


79 




1 o 

-.18 


.01 




-.23 


.28 


.34 


rii bcore 1 








-.13 


-.05 




-.21 


. J4 


. 38 






.67 




-.16 


.03 




-.08 


o r\ 
. JU 


. 41 






. 73 




-.21 


.04 




-.30 


.30 


. 29 






. 0 / 




.43 


. 31 




-.07 


.22 


.19 


rti bcore I 








.47 


.29 




-.02 




. z4 










.49 


.40 




.10 




. 20 










.39 


.27 




-.22 


O 1 


. 15 










.60 


.53 




.33 


-.06 


-.14 


1711 C _ ^ _ O 

rH bcore 3 










.57 




.36 


-. Uo 


1 c 

-. 15 












.59 




.40 


-.04 














.41 




.21 


n o 

-. Uo 


-. 07 












.48 




.14 


-.04 


-.05 


III O \»\J 1. c ^ 














.18 


- .06 


- . 11 
















.15 


-.02 


-.01 
















.09 


-.01 


-.02 
















.87 


-.09 


-.06 


r n o L.U I c J 
















. 10 


- ns 

. u o 


















-.18 


.02 


















-.26 


-.12 


















.85 


.15 


Cons. Obvious 


















.12 




















.20 




















.10 


Cons. Remote 


















.75 




29.3 


18.9 




14.4 


17.4 




62.4 


42.8 


16.7 


Mean 


29.4 


18.7 




14.0 


17.3 




62.5 


43.3 


17.0 


27.1 


18.0 




14.8 


17.4 




65.6 


44.2 


16.7 




31.4 


19.9 




14.3 


17.5 




59.3 


40.9 


16.5 




8.0 


5.6 




2.7 


2.1 




19.5 


15.1 


8.0 


S.D. 


7.9 


5.9 




2.8 


2.2 




22.0 


14.0 


7.6 


6.5 


4.5 




2.8 


2.3 




17.9 


16.5 


8.4 




8.8 


6.0 




2.5 


1.9 




17.8 


14.7 


8.0 



The fir t entry in each cell is the correlation for the total proup. 
Ihe next: three entries r.re the correlations for control, quality, and ouantity 
treatment groups, in that order. N's for the four groups are, respectively, 
395, 134, 129, and 132. For the total group, an R of .10 or greater is 
significantly different from zero at the 5% level, .13 at the 1% level. For 
the treatment group, an R of approximately .17 is significant at the 5% level 
.22 at the 1% level. 



1-^9^" Reliabilities (shown in the diagonal) are the average item intercorrela- 

hKJC within the tota] group corrected for length by the Spearman-Brown formula. 
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Table 4 

Correlations of Independent Variables with Dependent Variables 
for Total Group and for the Three Treatment Groups^ 



Dependent Variables 



Independent 


FH 


FH 


FH 




FH 


Cons . 


Cons . 










Variables 


Score 1 


Score 2 


Score 3 


Score 4 


Score 5 


Obvious 




Mean 


S. 






.12 


.14 


.06 


.05 


.12 


.21 


-.07 


i 


c 

. J 




.5 


Sex 


.00 


.08 


. 11 


.08 


.20 


.38 


-.07 


1 


c 
. J 




.5 


.19 
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-.05 


.16 


. 15 


-.97 
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.5 




, 5 




• 20 
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-.01 


. 13 


- 04 
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.5 




,5 




-.03 
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-.05 


.02 


-.05 


.05 


-.07 


J** 


/, 


16 


.0 


Test Anxiety 


-. 11 


-.06 


.09 


.20 


.01 


.06 


-.27 




. J 


17 


.4 


.06 


-.05 


-.15 


-.11 


-.17 


. 10 


.04 


35 


.2 


16 


.2 




-.01 


-.16 


-.14 


-.09 


-.01 


. 00 


.06 


35 


.7 


14 


.0 




.17 


.22 


.18 


.13 


.12 


.05 


.13 


1 /. 
i^ 




4 


7 


Vocabulary 


. io 


. 23 


. 14 


.11 


.13 


.08 


.19 




Q 


4 


.7 


.25 


.19 


.14 


.12 


. 20 


.05 


.16 


lA 


.9 


5 


.0 




.13 


.24 


.29 


.18 


.02 


.03 


.03 


15 


.0 




5 




.32 


.23 


-.05 


-T.08 


.08 


.43 


.28 


151 


3 


35 


2 


Ideational 


. 36 


. 28 


.06 


-.06 


.15 


J\l 


.29 


153. 


5 


35 


6 


Fluency 


.32 


.23 


-.08 


-.12 


.06 


.47 


.30 


148 


1 


36 


9 




. 29 


. 17 


-.12 


-.04 


.02 


.37 


.24 


i jZ 


i 


32 


9 




. SI 


.38 


-.10 


.03 


-.15 


.23 


.30 


15. 


9 


5 


0 


FH Pretest 


.62 


.38 


-.16 


.02 


-.13 


. 26 


.35 


16. 


2 


5 , 


1 


Score 1 


.43 


.39 


.01 • 


.08 


-.10 


. 21 


.23 


16. 


0 


4 


9 




. 53 


.41 


-.14 


-.01 


-.24 


.22 


.32 


15, 




5. 


1 




.42 


.46 


.16 


.14 


-.02 


.18 


.20 


11. 


7 


4, 


5 


FH Pretest 


.48 


.41 


.03 


.12 


-.03 


.20 


.20 


11. 


9 


4. 


3 


Score 2 


.39 


.53 


.30 


.73 


.07 
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6 
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2 
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.08 


-.07 


9. 


3 


1. 


9 


Score 3 


.09 
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.54 
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-.02 
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3 


2. 


1» 




-.01 


.24 


.45 


.11 
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.05 


-.12 


9. 


4 


2. 


4 




.00 


.13 


.26 


.23 


.16 


.01 


-.01 


11. 


9 


2. 


6 


FH Pretest 


-.07 


.07 


.21 


.30 


.19 


.01 


.03 


12. 


2 


2. 


4 


Score 4 


-.03 


.22 


.42 


.33 


.17 


-.09 


-.12 


11. 


6 


3. 


0 




.08 


.13 


.15 


.00 


.12 


.14 


.08 


11. 


8 


2. 


5 




-.17 


-.01 


.33 


.13 


.74 


-.04 


-.04 


39. 


1 


12. 


4 


FH Pretest 


-.15 


.03 


.36 


.10 


.80 


.08 


.01 


39. 


1 


12. 


2 


Score 5 


-.13 


.07 


.32 


.18 


.70 


-.13 


-.06 


39. 


4 


13. 


1 




-.22 


-.11 


.30 


.10 


.74 


-.06 


-.08 


38. 


7 


12. 


0 



The first entry in each cell is the correlation for the total group. The next three 
entries are the correlations for control, quality, c 1 quantity treatment groups, in that order. 
N's for the four groups are, respectively, 395, 134, ^29, and 132. For the total group, an R 
of .10 or greater is significantly different from zero at the 5% level, .13 at the 1% level. 
For the treatment groups, an R of approximately .17 is significant at the 5% level, .22 at 
the 1% level. 
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Table 6 

Results of MANCOVA for Treatment, Sex, 
Ideational Fluency, and Test Anxiety 



Correlations p-values 



Effect 


Multivariate 
significance 
level 


Means 




with 
variate 




for 
imi vflTi ate 

tests 












FH Score 1 


.76 


.001 




F = 


4.377 


Control Group 


.03 


FH Score 5 


-.46 


.003 


Treatment 


P < 


.001 


Quality Treatment 


-.47 


FH Score 2 


.46 


.003 




R = 


.369 


Quantity Treatment 


.44 


Cons.-Obv. 
FH Score 3 


-.31 
-.23 


.065 
.030 












Cons. -Rem. 


-.49 


.013 




F = 


3.637 


Male 


-.27 


FH Score 2 


.45 


.023 


Sex^ 


P < 


.001 


Female 


.27 


Cons.-Obv. 


.44 


.024 




R = 


.'>60 






FH Score 5 


.43 


.030 












Cons .-Obv. 


.82 


.001 


Ideational 


F =13.654 


Low 


-.47 


FH Score 1 


.51 


.001 


b 

Fluency 


P < 


.001 


High 


.47 


Cons. -Rem. 


.48 


.001 




R = 


.462 






FH Score 2 


.33 


.001 












Cons. -Rem. 


.57 


.033 


Test 


F = 


1.461 


Low 


.26 


FH Score 2 


.50 


.104 


Anxiety^ 


P < 


.120(n.s.) 


Middle 


.01 


FH Score 5 


.36 


.259 




R = 


.220 


High 


-.27 


FH Score 1 


.31 


.337 


Treatment- 


F = 


1.274 












Test Anxiety 


P < 


.155(n.s.) 






FH Score 4 




.014 


Interaction^ 


R = 


.211 













College attended and the five FH Pretest scores uses as covariates. 
College attended used as the only covariate. 



-31- 



Table 7 

Results of MANCOVA for Treatment, Sex, 
Ideational Fluency, and Vocabulary 



Effect 


Multivariate 
significance 
level 


Means 


Correlations 
with 
canonical 
variate 


p— values 
for 
univariate 
tes ts 








FH Score 1 


.74 


.001 




F = 4.455 


Control Group .04 


FH Score 2 


.45 


.003 


^ a 
Treatment 


p < .001 


Quality Treatment -.46 


FH Score 5 - 


-.44 


.004 




R= .371 


Quantity Treatment .42 


Cons.-Obv. • 


-.32 


.061 


. , 






FH Score 3 ■ 


-.22 


.029 








Cons. -Rem. ■ 


-.54 


.009 




F = 3.286 


Male - . 24 


Cons.-Obv. 


.53 


.011 


b 

Sex 


p < .002 


Female . 24 


FH Score 2 


.35 


.094 




R= .248 




FH Score 5 


.33 


.115 








Cons.-Obv. 


.81 


.001 


J. UC dl. J,|J llcl± 




Low -.65 


FH Score 1 


.51 


.001 


Fluency 


P < .001 


Middle -.05 


Cons. -Rem. 


.46 


.001 




R = .497 


High .70 


FH Score 2 


.30 


.005 








ni Score 3 


.80 


.001 


Vocabulary^ 


F = 4.827 


Low -.28 


FH Score 4 


.66 


.001 


p < .001 


High . 28 


FH Score 2 


.58 


.001 




R - .296 




FH Score 5 


.51 


.003 








FH Score 2 


.64 


.032 


Sex- 


F = 1.598 




FH Score 4 


.61 


.041 


Vocabulary 


p < .135(n. 


s.) 


FH Score 3 


.60 


.043 


In te raction*^ 


R = .176 




Cons. -Rem. 
Cons .-Obv. - 


.45 
.33 


.132 
.271 


College 


attended and 


the five FH Pretest scores 


used as covariates. 





ERIC 



College attended used as the only covariate. 



Table 8 

Results of MAI^COVA for Treatment, Sex, 
Vocabulary, and Test Anxiety 



Effect 


Multivariate 
significance 
level 


Means 




Correlations 
with 
canonical 
variate 


p-valuej; 
for 
univariate 
tes ts 












FH Score 1 


.80 


.001 


Treatment^ 


F 


= 4.162 


Control Grotin 


.01 


FH Score 2 


.48 


.002 


P 


< .001 


Oualitv Trpatmpnt — 


.46 


FH Score 5 - 


.48 


.002 




K 


Oil Q 




45 


FH Score 3 - 


.26 


.027 












Cons.-Obv. 


.67 


. UUi 




F 


= A.7A2 


Male 


.30 


FH Score 2 


.46 


.008 


, b 
Sex 


P 


< .001 


Female 


.30 


FH Score 1 


.40 


.020 




R 








FH Score 5 


.34 


.049 












FH Score 3 


.78 


. U'Ji 




F 


= 5.128 


Low — 


.31 


FH Score 4 


.65 


.001 


b 

Vocabulary 


P 


< .001 


High 


.31 


FH Score 2 


.58 


.001 




R 


= .304 






FH Score 5 


.46 


.005 












Cons. -Rem. 


.59 


.032 


Test 


F 


= 1.415 


Low 


.25 


FH Score 2 


.42 


.219 


Anxiety 


]\ 


< .140 


Middle 


.02 


FH Score 1 


.32 


.324 




R 


= .214 


High 


.23 


FH Score 4 - 


.30 


.336 












Cons.-Obv. - 


.53 


.026 


Sex- 


F 


= 2.498 






FH Score 2 


.52 


.029 


Vocabulary 
Inte raction 


P 


< .016 






Yli Score 3 


.51 


.033 


R 


= .218 






FH Score 4 


.50 


.037 


Treatment- 


F 


= 1.261 












Test Anxiety 


P 


< .164(11.3.) 






FH Score 4 




.012 


Interaction 


R 


= .216 













College attended and the five FH Pretest scores used as covariates. 
College attended used as the only covariate. 



